573 research outputs found

    CAMISIM: Simulating metagenomes and microbial communities

    Get PDF
    © 2019 The Author(s). Background: Shotgun metagenome data sets of microbial communities are highly diverse, not only due to the natural variation of the underlying biological systems, but also due to differences in laboratory protocols, replicate numbers, and sequencing technologies. Accordingly, to effectively assess the performance of metagenomic analysis software, a wide range of benchmark data sets are required. Results: We describe the CAMISIM microbial community and metagenome simulator. The software can model different microbial abundance profiles, multi-sample time series, and differential abundance studies, includes real and simulated strain-level diversity, and generates second- and third-generation sequencing data from taxonomic profiles or de novo. Gold standards are created for sequence assembly, genome binning, taxonomic binning, and taxonomic profiling. CAMSIM generated the benchmark data sets of the first CAMI challenge. For two simulated multi-sample data sets of the human and mouse gut microbiomes, we observed high functional congruence to the real data. As further applications, we investigated the effect of varying evolutionary genome divergence, sequencing depth, and read error profiles on two popular metagenome assemblers, MEGAHIT, and metaSPAdes, on several thousand small data sets generated with CAMISIM. Conclusions: CAMISIM can simulate a wide variety of microbial communities and metagenome data sets together with standards of truth for method evaluation

    MOSAIC: an online database dedicated to the comparative genomics of bacterial strains at the intra-species level

    Get PDF
    BACKGROUND: The recent availability of complete sequences for numerous closely related bacterial genomes opens up new challenges in comparative genomics. Several methods have been developed to align complete genomes at the nucleotide level but their use and the biological interpretation of results are not straightforward. It is therefore necessary to develop new resources to access, analyze, and visualize genome comparisons. DESCRIPTION: Here we present recent developments on MOSAIC, a generalist comparative bacterial genome database. This database provides the bacteriologist community with easy access to comparisons of complete bacterial genomes at the intra-species level. The strategy we developed for comparison allows us to define two types of regions in bacterial genomes: backbone segments (i.e., regions conserved in all compared strains) and variable segments (i.e., regions that are either specific to or variable in one of the aligned genomes). Definition of these segments at the nucleotide level allows precise comparative and evolutionary analyses of both coding and non-coding regions of bacterial genomes. Such work is easily performed using the MOSAIC Web interface, which allows browsing and graphical visualization of genome comparisons. CONCLUSION: The MOSAIC database now includes 493 pairwise comparisons and 35 multiple maximal comparisons representing 78 bacterial species. Genome conserved regions (backbones) and variable segments are presented in various formats for further analysis. A graphical interface allows visualization of aligned genomes and functional annotations. The MOSAIC database is available online at http://genome.jouy.inra.fr/mosaic

    Mapping Proprioception across a 2D Horizontal Workspace

    Get PDF
    Relatively few studies have been reported that document how proprioception varies across the workspace of the human arm. Here we examined proprioceptive function across a horizontal planar workspace, using a new method that avoids active movement and interactions with other sensory modalities. We systematically mapped both proprioceptive acuity (sensitivity to hand position change) and bias (perceived location of the hand), across a horizontal-plane 2D workspace. Proprioception of both the left and right arms was tested at nine workspace locations and in 2 orthogonal directions (left-right and forwards-backwards). Subjects made repeated judgments about the position of their hand with respect to a remembered proprioceptive reference position, while grasping the handle of a robotic linkage that passively moved their hand to each judgement location. To rule out the possibility that the memory component of the proprioceptive testing procedure may have influenced our results, we repeated the procedure in a second experiment using a persistent visual reference position. Both methods resulted in qualitatively similar findings. Proprioception is not uniform across the workspace. Acuity was greater for limb configurations in which the hand was closer to the body, and was greater in a forward-backward direction than in a left-right direction. A robust difference in proprioceptive bias was observed across both experiments. At all workspace locations, the left hand was perceived to be to the left of its actual position, and the right hand was perceived to be to the right of its actual position. Finally, bias was smaller for hand positions closer to the body. The results of this study provide a systematic map of proprioceptive acuity and bias across the workspace of the limb that may be used to augment computational models of sensory-motor control, and to inform clinical assessment of sensory function in patients with sensory-motor deficits

    Dissecting the Relation between a Nuclear Receptor and GATA: Binding Affinity Studies of Thyroid Hormone Receptor and GATA2 on TSHβ Promoter

    Get PDF
    Background: Much is known about how genes regulated by nuclear receptors (NRs) are switched on in the presence of a ligand. However, the molecular mechanism for gene down-regulation by liganded NRs remains a conundrum. The interaction between two zinc-finger transcription factors, Nuclear Receptor and GATA, was described almost a decade ago as a strategy adopted by the cell to up-or down-regulate gene expression. More recently, cell-based assays have shown that the Zn-finger region of GATA2 (GATA2-Zf) has an important role in down-regulation of the thyrotropin gene (TSH beta) by liganded thyroid hormone receptor (TR). Methodology/Principal Findings: In an effort to better understand the mechanism that drives TSH beta down-regulation by a liganded TR and GATA2, we have carried out equilibrium binding assays using fluorescence anisotropy to study the interaction of recombinant TR and GATA2-Zf with regulatory elements present in the TSH beta promoter. Surprisingly, we observed that ligand (T3) weakens TR binding to a negative regulatory element (NRE) present in the TSH beta promoter. We also show that TR may interact with GATA2-Zf in the absence of ligand, but T3 is crucial for increasing the affinity of this complex for different GATA response elements (GATA-REs). Importantly, these results indicate that TR complex formation enhances DNA binding of the TR-GATA2 in a ligand-dependent manner. Conclusions: Our findings extend previous results obtained in vivo, further improving our understanding of how liganded nuclear receptors down-regulate gene transcription, with the cooperative binding of transcription factors to DNA forming the core of this process.Medical Research Council (MRC), UKConselho Nacional de Desenvolvimento Cientifico e Tecnologico (CNPq), Brazi

    Whole Genome Sequencing Reveals Local Transmission Patterns of Mycobacterium bovis in Sympatric Cattle and Badger Populations

    Get PDF
    Whole genome sequencing (WGS) technology holds great promise as a tool for the forensic epidemiology of bacterial pathogens. It is likely to be particularly useful for studying the transmission dynamics of an observed epidemic involving a largely unsampled ‘reservoir' host, as for bovine tuberculosis (bTB) in British and Irish cattle and badgers. BTB is caused by Mycobacterium bovis, a member of the M. tuberculosis complex that also includes the aetiological agent for human TB. In this study, we identified a spatio-temporally linked group of 26 cattle and 4 badgers infected with the same Variable Number Tandem Repeat (VNTR) type of M. bovis. Single-nucleotide polymorphisms (SNPs) between sequences identified differences that were consistent with bacterial lineages being persistent on or near farms for several years, despite multiple clear whole herd tests in the interim. Comparing WGS data to mathematical models showed good correlations between genetic divergence and spatial distance, but poor correspondence to the network of cattle movements or within-herd contacts. Badger isolates showed between zero and four SNP differences from the nearest cattle isolate, providing evidence for recent transmissions between the two hosts. This is the first direct genetic evidence of M. bovis persistence on farms over multiple outbreaks with a continued, ongoing interaction with local badgers. However, despite unprecedented resolution, directionality of transmission cannot be inferred at this stage. Despite the often notoriously long timescales between time of infection and time of sampling for TB, our results suggest that WGS data alone can provide insights into TB epidemiology even where detailed contact data are not available, and that more extensive sampling and analysis will allow for quantification of the extent and direction of transmission between cattle and badgers

    Genomic analysis of Pseudomonas putida: genes in a genome island are crucial for nicotine degradation

    Get PDF
    Nicotine is an important chemical compound in nature that has been regarded as an environmental toxicant causing various preventable diseases. Several bacterial species are adapted to decompose this heterocyclic compound, including Pseudomonas and Arthrobacter. Pseudomonas putida S16 is a bacterium that degrades nicotine through the pyrrolidine pathway, similar to that present in animals. The corresponding late steps of the nicotine degradation pathway in P. putida S16 was first proposed and demonstrated to be from 2,5-dihydroxy-pyridine through the intermediates N-formylmaleamic acid, maleamic acid, maleic acid, and fumaric acid. Genomics of strain S16 revealed that genes located in the largest genome island play a major role in nicotine degradation and may originate from other strains, as suggested by the constructed phylogenetic tree and the results of comparative genomic analysis. The deletion of gene hpo showed that this gene is essential for nicotine degradation. This study defines the mechanism of nicotine degradation

    Yersinia pestis Evolution on a Small Timescale: Comparison of Whole Genome Sequences from North America

    Get PDF
    Yersinia pestis, the etiologic agent of plague, was responsible for several devastating epidemics throughout history and is currently of global importance to current public heath and biodefense efforts. Y. pestis is widespread in the Western United States. Because Y. pestis was first introduced to this region just over 100 years ago, there has been little time for genetic diversity to accumulate. Recent studies based upon single nucleotide polymorphisms have begun to quantify the genetic diversity of Y. pestis in North America.To examine the evolution of Y. pestis in North America, a gapped genome sequence of CA88-4125 was generated. Sequence comparison with another North American Y. pestis strain, CO92, identified seven regions of difference (six inversions, one rearrangement), differing IS element copy numbers, and several SNPs.The relatively large number of inverted/rearranged segments suggests that North American Y. pestis strains may be undergoing inversion fixation at high rates over a short time span, contributing to higher-than-expected diversity in this region. These findings will hopefully encourage the scientific community to sequence additional Y. pestis strains from North America and abroad, leading to a greater understanding of the evolutionary history of this pathogen

    Sampling and counting genome rearrangement scenarios

    Get PDF
    Even for moderate size inputs, there are a tremendous number of optimal rearrangement scenarios, regardless what the model is and which specific question is to be answered. Therefore giving one optimal solution might be misleading and cannot be used for statistical inferring. Statistically well funded methods are necessary to sample uniformly from the solution space and then a small number of samples are sufficient for statistical inferring

    Murasaki: A Fast, Parallelizable Algorithm to Find Anchors from Multiple Genomes

    Get PDF
    BACKGROUND: With the number of available genome sequences increasing rapidly, the magnitude of sequence data required for multiple-genome analyses is a challenging problem. When large-scale rearrangements break the collinearity of gene orders among genomes, genome comparison algorithms must first identify sets of short well-conserved sequences present in each genome, termed anchors. Previously, anchor identification among multiple genomes has been achieved using pairwise alignment tools like BLASTZ through progressive alignment tools like TBA, but the computational requirements for sequence comparisons of multiple genomes quickly becomes a limiting factor as the number and scale of genomes grows. METHODOLOGY/PRINCIPAL FINDINGS: Our algorithm, named Murasaki, makes it possible to identify anchors within multiple large sequences on the scale of several hundred megabases in few minutes using a single CPU. Two advanced features of Murasaki are (1) adaptive hash function generation, which enables efficient use of arbitrary mismatch patterns (spaced seeds) and therefore the comparison of multiple mammalian genomes in a practical amount of computation time, and (2) parallelizable execution that decreases the required wall-clock and CPU times. Murasaki can perform a sensitive anchoring of eight mammalian genomes (human, chimp, rhesus, orangutan, mouse, rat, dog, and cow) in 21 hours CPU time (42 minutes wall time). This is the first single-pass in-core anchoring of multiple mammalian genomes. We evaluated Murasaki by comparing it with the genome alignment programs BLASTZ and TBA. We show that Murasaki can anchor multiple genomes in near linear time, compared to the quadratic time requirements of BLASTZ and TBA, while improving overall accuracy. CONCLUSIONS/SIGNIFICANCE: Murasaki provides an open source platform to take advantage of long patterns, cluster computing, and novel hash algorithms to produce accurate anchors across multiple genomes with computational efficiency significantly greater than existing methods. Murasaki is available under GPL at http://murasaki.sourceforge.net
    • …
    corecore